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Abstract 

The Majority is Stablest Theorem has numerous applications in hardness of approximation and social choice 
theory. We give a new proof of the Majority is Stablest Theorem by induction on the dimension of the discrete cube. 
Unlike the previous proof, it uses neither the "invariance principle" nor Borell's result in Gaussian space. The new 
proof is general enough to include all previous variants of majority is stablest such as "it ain't over until it's over" 
and "Majority is most predictable". Moreover, the new proof allows us to derive a proof of Majority is Stablest in 
a constant level of the Sum of Squares hierarchy. This implies in particular that Khot-Vishnoi instance of Max-Cut 
does not provide a gap instance for the Lasserre hierarchy. 
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1 Introduction 



The proof of the Majority is Stablest Theorem HMOOIOI affirmed a conjecture in hardness of approximation IIKKMO071 
and in social choice |Kal02 |. The result has been since extensively used in the two areas. One of the surprising features 
of the proof of fMOOlOJ is the crucial use of deep results in Gaussian analysis IIBor85l and an ' Tnvariance Principle" 
that allows to deduce the discrete result from the Gaussian one. 

Since the statement of the Majority is Stablest Theorem OMOOlOl deals with functions on the discrete cube, it is 
natural to ask (as many have) if there is a "discrete proof of the statement that Majority is Stablest. In this paper we 
answer this question affirmatively and provide a short general proof of the Majority is Stablest Theorem. The proof 
does not rely on Borell's result, nor does it rely on the ' Tnvariance Principle". 

We also show how the new proof can be transformed into a "Sum of Squares" proof of the Majority is Stablest 
Theorem, thus showing that Khot-Vishnoi instance of Max-cut IIKV05I does not provide an integrality gap instance 
for Max-cut in the Lasserre hierarchy. 

1.1 Functions with low influence variables In discrete Fourier analysis, special attention is devoted to functions 
/ : { — 1, 1}" — > {0, 1} with low influences. The ith influence of / is defined by 

Iiifi(/) = P[f{xi, . . . ,a;„) 7^ /(xi, . . . ,Xi^i,-Xi,Xi+i, . . .,Xn)], (1) 

where P denotes the uniform distribution on the discrete cube. 

Functions with low influences have played a crucial role in the development of the theory of discrete Fourier 
analysis. Starting with Kahn, Kalai, and Linial IIKKL88I ITal94l IFK96I . the use of hyper-contractive estimates applied 
to low influence variables is one of the main techniques in discrete Fourier analysis. 

Of particular interest are functions all of whose influences are low. The work of Friedgut and Kalai IIFK96II 
shows that low influence functions have sharp thresholds. Central work in theoretical computer science IIBou02l 
IDS05I [ST06I pointed to the importance of low influence functions, including in the context of the "Unique Games 
Conjecture" ||Kho02t jKKMOOTll . Such functions have also attracted much interest in the theory of social choice, see 
e.g. IFM98IIKal04l 

In the context of voting it is natural to exclude voting schemes that give individual voters too much power. The 
same is true in the theory of hardness of approximation where a central concept is to distinguish between functions 
that really depend on many variables versus those who have a strong dependency on a small number of variables, see 
e.g. IHas97llKho02l lDS05l. 

The Majority is Stablest theorem has been crucial in developments in both hardness of approximation and the 
theory of social choice. The theorem considers the correlation between f{x) and f{y) where x,y £ {—1, 1}" are p- 
correlated vectors with p > 0. AssumingE[/] = 1/2, the function that maximizes E[/(x)/(j/) + (l — /(a;))(l — /(y))] 
is a dictator function. The majority is stablest theorem states that for functions with low influences the value of 
E[/(x)/(?/) + (1 — f{x)){l — f{y))] cannot be much larger than the corresponding value for the majority function. 
More formally. 

Definition 1.1. For p S (—1, 1), the noise stability of f : {— 1, 1}" — > R af p is defined to be 

Stab,(/) E[/(x)/(y) + (1 - f{x)){l - /(y))], 

when [x, y) G {—1, 1}" x {^1, 1}" is chosen so that (xi, yi) G { — 1, 1}^ at'e independent random variables with 
'E[xi] = Y^[yi] = Qand'E[xiyi\ = p. 

Theorem 1.2. "Majority Is Stablest" HMOOIOI Let < p < 1 and e > be given. Then there exists r > such 
that if f : {-1, 1}" [0, 1] satisfies E[/] = 1/2 andlnf^^f) < t for all i, then 

Stabp(/) < 1 - ^^^^^^ + e. 

By Sheppard's Formula ||She991 , the quantity 1 - H£^1p is precisely lim„^oo Stabp(Maj„), where 

n 

Maj„(xi,...,a;„) = sign(^a;,;), 

We also remark here that Theorem 11 .21 readilv generalizes to the case when E[/] ^ 1/2 with the right hand 
side replaced by the corresponding quantity for a shifted majority with the same measure. This statement of Majority 
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is Stablest was conjectured in IIKKMO07I in the context of hardness of approximation for Max-Cut. By assuming 
that Theorem 11.21 holds, the authors showed that it is 'Unique Games-hard' ' to approximate the maximum cut in 
graphs to within a factor greater than .87856. . . . This result is optimal, since the efficient algorithm of Goemans 
and Wilhamson IIGW95I is guaranteed to find partitions that cut a .87856. . . fraction of the maximum. A closely 
related conjecture (for p = —1/3) was made by Kalai in the context of Arrow's Impossibility Theorem IIKal02ll . The 
results of IMOOIOII imply Kalai' s conjecture and show that Majority minimizes the probability of Arrow's paradox 
in ranking 3 alternatives using a balanced ranking function /. See IIKal02l IMOOIOII for more details. 

The statement of the theorem deals with Boolean functions, yet the proof of IMOOIOII crucially relies on Gaussian 
analysis as (a) it uses a deep result (with a hard proof) of Borell [BorSSl] on noise stability in Gaussian space and 
(b) it uses the invariance principle developed in IMOOIOII that allows to deduce discrete statements from Gaussian 
statements. This raises the following natural (informal) question: 



Question: Is there a "discrete" proof of Majority is Stablest? 



In other words, does there exist a proof of Majority is Stablest not using Borell' s result? or any other result 
in Gaussian space? We note that almost all prior results in discrete Fourier analysis do not use Gaussian results. 
In particular, the classical hyper-contractive estimates IBonTOl |Bec75ll are proved by induction on dimension in the 
discrete cube. Moreover, most of the results in the area starting from KKL including IIKKL881 |Tal94| IFK961 |Bou"02| 
do not require sophisticated results in Gaussian geometry. 

In our main result we provide a positive answer to the question above. Informally we show that 



Main Result: There is a proof of Majority is Stablest by induction on dimension. 



Our proof is short and elegant and involves only elementary calculus and hyper-contractivity. The main difficulty 
in the proof is finding the right statement to prove by induction. The induction statement involves a certain function J, 
which was recently used in the derivation of a robust version of Borell' s result and Majority is Stablest IIMN12II using 
Gaussian techniques and the invariance principle. 

In a way, our results here are an analogue of Bobkov's famous inequality in the discrete cube IIBob97ll . Bobkov 
proved by induction a discrete functional inequality that at the limit becomes the Gaussian isoperimetric inequality. 
Moreover, Bobkov' s functional is crucial for the semi-group proof of the Gaussian isoperimetric inequality. In IIMN12II 
a functional version of Borell' s result is defined and proved using the "semi-group" method in Gaussian space. Here 
we prove a discrete version of the same functional inequality. 

It is well known that the Majority is Stablest Theorem implies Borell' s result. Here we show how this can 
be done by elementary methods only (our proof of Borell's result does not even require hyper-contractivity!). Our 
proof of Borell's result joins a number of recent proof of the result including using spherical symmetrization, see 
e.g. IIIM12L sub-additivity IIK012II and a semi-group proof IIMN12I . It is the simplest proof of Borell's result using 
elementary arguments only (HMEI uses sophisticated spherical re-arrangement inequalities, IIK012I only works for 
sets of measure 1/2 and certain noise values and IIMN12I requires basic facts on the Orenstein-Uhlenbeck process). 

Since it was proved, Theorem ll.2l was generalized a number of times including in IIDMR06I IMos 1 01 . The results 
and their generalization have been used numerous times in hardness of approximation and social choice including 
in IIAus07IIOW08l RagOS IMosl2llFKN081 . Our simple proof extends to cover all of the generalization above. It also 



enables to prove an SoS version of the statement of Majority is Stablest, thus answering the main open problem of 
IIOZ13I as we discuss next. 

1.2 Sum of Squares proof system We now discuss an application of our new proof of Majority is Stablest to 
hardness of approximation. To discuss the application, we will first need to introduce the "Sum of Squares" (SoS) 
proof system. In a nutshell, the SoS proof system is an algebraic proof system (introduced by Grigoriev and Vorob- 
jov IGVOlll ) where constraints are encoded by polynomial (in)equalities and the deduction rules are specified by a 
restricted class of polynomial operations. Viewing this proof system as a refutation system for polynomial inequali- 
ties, the goal is to show that the given system of constraints is infeasible by using the allowed polynomial operations 
to arrive at a polynomial constraint which is "obviously" infeasible. 

Without further ado, we introduce the following notation: let X — {Xi, . . . be a sequence of variables 
and let M.[X] be the ring of polynomials on X. Let A = {pi > 0, . . . ,pm > 0} be a set of constraints (on X = 
{Xi, . . . , Xn))- Also, let M[X] C M.[X] be the set of polynomials which can be expressed as sums-of-squares. In 
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other words, q G M[X] if and only if q ~ rf + . . . + r"^ where ri, . . . , G For S C [to], we use ps to denote 

riiesPi withp0 = 1. Now, suppose that for all S C [m] there exists qs e M[X] such that 

-1 = ^ PS ■ qs 

SC[m] 

Then, it is clear that the constraint set A is infeasible over M". The surprisingly powerful theorem of Stengle IISte74l 
(and earlier shown by Krivine IIKri64l ) shows that whenever A is infeasible, such a certificate of infeasibility always 
exists. This theorem is known as Stengle' s Positivstellensatz. In fact, provided a certain compactness condition holds, 
the certificate of infeasibility (i.e. the set {qs : S C [to]}) can always be assumed to have qs ~ for |5| > 1; this is 
due to Putinar IIPut93l . 

While these results were well-known in the algebraic geometry community and are intimately tied to Hilbert's 
seventeenth problem IIHil88l . the interest in the theoretical computer science community is relatively new. The first 
to view Stengle' s positivstellensatz as a proof system for refutation were Grigoriev and Vorobjov IGVOll (It should 
be mentioned that an earlier paper ILMR96I also considered the proof theoretic aspects of Positivstellensatz but no 
attempt was made to quantify the complexity of such proofs). From the point of view of complexity theory, it is 
interesting to consider restricted proof systems where one only looks at proofs of refutation where maxdeg(p5 • 
Qs) < d. We refer to this as the degree-c? SoS hierarchy. This is essentially the dual of d/2-level of the Lasserre 
hierarchy ILasOll . 

The reason to consider the degree-d SoS hierarchy is that while one loses completeness (i.e. infeasible constraint 
sets A may not have a proof of refutation in the degree-d SoS hierarchy for a fixed d), the degree-rf SoS hierarchy is 
effective in the sense that if the set A has a proof of infeasibility of degree d, then it can be found in time 0{m ■ rfi'^'^^) 
using semidefinite programming (see Parrillo IParOOII and Lasserre ILasOlll ). It should be mentioned that the so called 
Lasserre hierarchy ILasOll and the SoS hierarchy are essentially duals of each other. So, for the subsequent discussion, 
whenever we use the term Lasserre hierarchy, we mean the Lasserre / SoS hierarchy. 

Given that the degree-d SoS hierarchy is automatizable, several researchers tried to understand the limitations of 
its power. Grigoriev IGriOll showed linear lower bounds for proofs of refutation of Tseitin tautologies and the mod 2 
principle. The latter result was essentially rediscovered by Schoenebeck in the Lasserre world independently ISchOSL 

Applications to hardness of approximation: While the results of Parillo llParOOl and Lasserre ILasOll have been 
known for more than a decade, there were only a few works in the theoretical computer science community which 
harnessed the algorithmic power of IParOOIlLasOll (see IIBRSlllfCSOSI ). In fact, for the results which did use Lasserre 
hierarchy, it was not clear if the full power of Lasserre hierarchy was required, or whether weaker hierarchies, like the 
one of Lovasz and Schrijver, would suffice. 

However, in a recent exciting paper, Barak et al. lBBH+121 used the degree-8 SoS hierarchy to refute the known 
integrality gap instances for Unique Games IKV05I IRS091 IKPS 1 01 . In other words, there are degree 8 SoS proofs 
which can be used to certify that the true value of the integrality gap instances is o(l). This is interesting for two 
reasons. The first is that even after a decade of intense investigation, these integrality gaps remained essentially the 
only evidence towards the truth of the Unique Games Conjecture (UGC). Thus, the SoS hierarchy discredits these 
instances as evidence towards the truth of the UGC. The second reason is that these integrality gaps were known 
to survive 17 ( (log log n)^/^) rounds of weaker hierarchies like "SDP + Sherali Adams" IRS09II or "Approximate 
Lasserre" IKS09II . Thus, this showed a big gap between the Lasserre/SoS hierarchy and the weaker hierarchies like 
"SDP+Sherali Adams" or "Approximate Lasserre". 

We now mention the main idea behind showing that degree-8 SoS hierarchy refutes the known integrality gap 
instances for Unique Games IKV05I IRS09I IKPS 101 . Analyzing the true optimum of these instances uses tools from 
analysis like hypercontractivity IBon70llBec75L the KKL theorem IKKL88II etc. Hence, to show that degree d-SoS 
hierarchy can refute these instances, one essentially needs to prove SoS versions of these statements in the degree-d 
SoS hierarchy. Note that so far we have only viewed the SoS as a refutation system, but in fact, as we will see a 
little later, there is an easy extension of th e earlier de finition, which formalizes the notion of proving a statement in 
the degree d-SoS hierarchy. In particular, llBBH+121 prove SoS versions of results like hypercontractivity, small-set 
expansion etc. 

Extending the results of |BBH+12|, O'Donnell and Zhou IIOZ13I analyze the problems "upward" of unique 
games like MAX-CUT and BALANCED-SEPARATOR. In particular, IOZ13I refutes the integrality gap instances 
of balanced separator from DKSV06I . Since the key to analyzing the optimum of the BALANCED-SEPARATOR 
instances in IIDKSV06I is the KKL theorem IKKL88I . the authors provide a proof of the KKL theorem in the degree-4 
SoS hierarchy. For MAX-CUT, their results are somewhat less powerful. Again, here they analyze the instances of 
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MAX-CUT from IIKV05I . More precisely, for any p £ (-1, 0), IIKV05I construct gap-instances of MAX-CUT where 
the true optimum is arccos p/n + o{l) whereas the basic SDP-optimum is (1 — p)/2 + o(l). The key to analyzing the 
true optimum is the Majority is Stablest theorem of BMOOIOI . Thus, to refute these instances completely i.e. show 
that the true optimum is arccos p/n + o(l), the authors essentially needed to prove the Majority is Stablest theorem in 
some constant degree-rf SoS hierarchy. While the authors could not prove that, they do manage to prove the weaker 
"2/7r" theorem from | KKMO 07 1 in (some constant degree of) the SoS hierarchy. This implies that the SoS hierarchy 
can certify that the true optimum is at most (1/2 — p/n) — (1/2 — l/n)p^. They left open the problem of refuting 
this gap instances optimally i.e. showing that constant number of rounds of the SoS hierarchy can certify that the 
true optimum of these gap instances is arccos p/tt + o(l). In this paper, as the main application of the new proof of 
Majority is Stablest, we resolve this problem. 

It should be mentioned here that while the new proof of Majority is Stablest is more suitable for the SoS hier- 
archy, several powerful theorems and techniques are needed to achieve this adaptation. For example we use results 
from approximation theory ||Lor86l and a powerful matrix version of Putinar's Positivstellensatz IILasIOII to prove 
that a certain polynomial approximation preserves positiveness. We mention here that unlike the previous two papers 
IIBBH+1 2 0Z13I connecting SoS hierarchy with hardness of approximation, we make essential use of Putinar's Pos- 
itivstellensatz (i.e. essentially the completeness of the SoS hierarchy). The following is the main theorem concerning 
the power of SoS hierarchy on MAX-CUT instances. 

Theorem. SoS-version of MAX-CUT For every 6 € (0, 1) and p G (—1,0), 3d = d{5,p) such that the degree- 
d SoS hierarchy can certify that the MAX-CUT instances from hKV05\l with noise p have true optimum less than 
arccos p/i: + S. 

As the key intermediate theorem, we establish a SoS version of the well-known version of the Majority is Stablest 
theorem (for p E (—1, 0)) which is stated next informally. 

Theorem. SoS-version of Majority is Stablest For eve G (0,1) and p <E {—1,0), there are constants c ~ c{S, p) 
and d — d(6, p) such that the following is true : Let < f{x) < I for all x G { — 1, 1}" and max; Infi(f) < r. There 
is a degree-d SoS proof of the statement Stab^ (/) > 1 — arccos p/tt — S — c ■ t. 

Our proof can be easily modified to give the analogous statement of Majority is Stablest when p E (0, 1). Of 
course, we have to change the direction of the inequality as well as impose the condition that E[/] = 1/2 (this 
condition is not required when p E (—1,0)). 

As the reader can see, the theorem is stated very informally. This is because SoS proofs are heavy in notation 
and its difficult to express the precise statement without having the proper notation. However, we do remark that SoS 
version of MAX-CUT follows easily by composing the proof of refutation of UNIQUE-GAMES instances of IIKV05II 
(done in rtBBH+I2l ') along with the I.KKMO07 I reduction (the proof of soundness of this reduction is the step where 
we require the SoS version of Majority is Stablest). 

2 Sum of Squares hierarchy 

In this section, we formally give an introduction to the Sum of Squares (hereafter abbreviated as SoS) hierarchy. To 
define the SoS hierarchy, let X = (xi, . . . , Xn) and let K[X] be the ring of real polynomials over these variables. 
We also let R<rf[X] denote the subset of R[X] consisting of polynomials of total degree bounded by d. As before, 
let M.[X] C M[X] be the set of polynomials which can be expressed as sums-of-squares. We next define a set of 
constraints given as : 

• A, = {pi{X) = 0,P2iX) = 0, . . .,PmiX) = 0} 

. Ag = {qi{X) > 0, q2{X) > 0, . . . , QiiX) > 0}. 

Before we go ahead, we define the set A4n.d[X] as the set of monomials over xi, . . . ,Xn of degree bounded by d. 
Also, let M.<d [X] denote the subset of M[X] of polynomials of degree bounded by d. Further, if A = AeU Ag, define 
V(A) = {X : A holds on X}. We next define the (degree d) closure of these constraints.. 

CdiAe) = {pAX) ■ p{X) : s G H, PiX) G X„,d[X] and deg(p) + deg(p,) < d} 
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CdiAg) = {J] (X) : ai, . . . , a„ e Z+ and ^ a, • deg(<j,;) < d} 



Note that includes 1 e 



Fact 2.1. Given and Ag as described above, the sets Cd{Ae) and Cd{Ag) can be computed in time 



,0(d) . , 



It is obvious that without loss of generahty, we can impose the constraints : p{X) G Cd{Ae), p{X) — and for 
q{X) e Cd{Ag), q{X) > 0. Next define, 

Cd{A) = {p{X) = : yp{X) e Cd{A,)} U {q{X) > : yq{X) e Cd{Ag)} 



Definition 2.2. For the constraint set A ^ A^ U Ag defined above and h(X) G IR[X], we say A h{X) > if and 
only if 

h{x)= ^P-pix)+ 

p(X)eCd(Ae) 9(x)eCd(Ag) 

where ap G M, G M[X] and for all q{X) G Cd{Ag), dcg(rq) + dcg(g) < d. In this case, we say that A degree-d 
SoS proves h{X) > 0. 

For the constraint set A, we say that A — 1 > if and only if there exists 

-1= J2 "P-M^)+ E ^^w-'jW 

p(x)eCd(Ae) 9(X)eCd(A,) 

with the same constraints on ap and as above. In this case, we say that there is a degree-d SoS refutation of the 
constraint set A. 

Note that we are adopting the same notation as in IIOZ13I . The reason we are interested in Definition 12.21 is 
because one can efficiently decide if A — 1 > using semidefinite programming. This is because deciding if 
A \-d —1 > is equivalent to refuting the existence of a map E : S.<:d[X] M satisfying the following conditions 
(see UParOOl for more details) 

• E{1) = 1. 

• It is a Hnear map i.e. for every g, h G ]R<d[X] and a, /? G M, E{ag + (3h) = aE{g) + l3E{h). 

• For every h G Cd{Ae), E{h) = 0. 

• For every h G Cd{Ag) and g G M<d[X], such that deg(g ■ h) < d, E{g ■ h)>0. 

A map E which satisfies all the above constraints is called a degree-d SoS consistent map for the constraint set 
A ~ Ae U Ag. Lasserre ULasOIl and Parillo QParOOl have shown that using semidefinite programming, it is possible 
to decide the feasibility of such a map E in time m • •nP^'''\ In fact, if there exists such a map E, then the algorithm 
outputs one in the same time. It is important to mention that since E has an infinite domain, it is not obvious what one 
means by outputting the map. To see why this makes sense, note that i? is a linear map and hence it suffices to give to 
specify E on the set M.n,d[X]. We also remark here that the notion of finding a mapping E is close to the viewpoint 
taken by Barak et al. rtBBH+12l . 

To get started with SoS proof systems, we state a few facts (which are very easy to prove) : 

Fact 2.3. • If A p > and A' ^d' q > 0, then AU A' l-,„ax{d,d'} P + <? > 0. 

• If A hrf p > and A'rd' q> 0, then A 'rd+d' p- q>0 

• If A hd {pi > 0,P2 > 0, . . . ,p,„ > 0} and {pi > 0,p2 > 0, . . . ,p,„ > 0} h^' q > 0, A hd-d' q > 0- 
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Several other SoS facts are proven in the Appendix [C] We suggest that the non-expert reader look at the Ap- 
pendix|C]to get more comfortable with the notion of SoS proofs. For rest of the paper, we set the following convention 
for indeterminates appearing in SoS proofs : Capital letters X, Y and Z will be used to denote a sequence of inde- 
terminates (i.e. X — {xi, . . . , a;„)) while small letters x, y and z wiU be used to indicate single indeterminates. This 
convention is however only for indeterminates in the SoS proofs. For other variables, both capital and small letters 
will be used. Also, we will consider polynomials on the indeterminates occurring in the SoS proofs. Whenever we 
refer to such polynomials without an explicit reference to the underlying indeterminates, the set of indeterminates will 
be clear from the context. To get the reader more acquainted with the power of SoS proofs, we state the following 
powerful result of Putinar which we use repeatedly. 

Theorem 2.4. ||Put93l Let A = {pi{X) > 0, ...,p„(X) > 0} and define M{A) = ELi + where 
rp, . . . , r„i G M[X]. Assume that 3q e M(A) such that the set {X : q{X) > 0} is compact. If p > on the 
set Y{A), then p e M{A). 

As a key step in one of our proofs, we will also require a matrix version of Putinar' s Positivstellensatz (see OLaslOl 
for details). A matrix T e (M[X])''^'' is said to be a sum-of-squares if there exists B e {R[X]y'^'^ (for some q e N) 
such that B - B'^ ^T. 

Theorem 2.5. Let A ~ {pi{X) > 0, . . . ,Pm{X) > 0} be satisfying the conditions in the hypothesis ofTheorem \2.4\ 
Let r S (R[X])P^P be a symmetric matrix and S > be such that T ^ SI on the set "V{A). Then, T = Tq{X) + 
Sl^i ^ii-^) ■ Pi{^) where Fq, . . . , r„j are sum-of-squares. 

3 Our tensorization theorem 

In this section, we will prove our main tensorization inequality on the cube. In subsequent sections, we will use it 
to give new proofs of the "Majority is Stablest" theorem of Mossel, O'Donnell and Oleszkiewicz IIMOOIOI and the 
Gaussian stability inequality of Borell IIBor85l . We begin by defining the following function from IIMNI2I for every 



Definition 3.1. Let fii and ^2 be probability spaces and fi be a probability measure on fli x D,2. We say that fj, has 
Renyi correlation at most p if for every measurable / : f2i — > M and g : — >■ M with / = <? = 0, 



For example, suppose that Vli = fl-z and suppose {X, Y) are generated by the following procedure: first choose 
X according to some distribution v. Then, with probability p set y = X, and with probability 1 — p, choose Y 
independently from i^. If fi is the distribution of {X, Y), then it is easy to check that ^ has Renyi correlation p. 

Definition 3.2. Iffl is a probability space and f is a function f2" — > M, then for X ^ il, we define fx ■ f^"^^ — > M 



p e [-1, 1] : Jp : (0, 1)2 [0, 1] as 



Jpix,y) = Prx,y[X < ^-\x),Y < ^-\y)] 



Here X, Y are jointly normally distributed random variables with the covariance matrix 




by 



fx{Xi, . . . , Xn~l) — /{Xi, . . . , Xn-l,X). 



Definition 3.3. For a function f : fl ^R, define 



Ai(/) = E|/-E/|3. 



For a function f : fi" — )■ M, define A„(/) recursively by 



A„(/) = Ex„[A„_i(/;fJ] + Ai(E[/xjX„]), 



noting that E[/x„ l^n] is a function — > M. 
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We prove the following general theorem, which we will later use to derive both Borell's inequahty and the 
"Majority is Stablest" theorem. 

Theorem 3.4. For any e > and < p < 1, there is C{p) > such that the following holds. Let fi be a p-correlated 
measure on Q.i x 5^2 <^nd let {Xi, i-i.d. variables with distribution p,. Then for any measurable functions 

/ : Vt'l ^ [e, 1 - e] and g : ^ [e, 1 - e], 

EJp(/(X),.g(r)) < Jp(E/,E.9)+C(p)e-^('')(A„(/) + A„(.9)). 

We note that IIMN12II proved that in the Gaussian setup where /, g : M" — > [0, 1] it was shownthat E Jp(/(X), g{Y)) < 
Jp(E/,E5). 

3.1 The base case We prove Theorem l3.4l bv induction on n. In this section, we will prove the base case n — 1: 

Claim 3.5. For any e > and < p < 1, there is a C{p) such that for any two random variables X,Y (Iz [e, 1 — e] 
with correlation in [0, p], 

E Jp(X,y) < Jp{EX,'EY) + C(p)e"'^('')(E \X -EX\^ + E \Y - EFp). 

The proof of Claim [33] essentially follows from Taylor's theorem applied to the function J; the crucial point is 
that J satisfies a certain differential equation. Define the matrix Mpa{x, y) by 

\ dxdy d'-^x / 



Claim 3.6. For any (x,y) E (0, 1)'^ and < a < p, AIpa{x,y) is a negative semidefinite matrix. Likewise, if 
p < (7 < 0, then AIpcr{x, y) is a positive semidefinite matrix. 

Claim 3.7. For any — 1 < p < 1, there exists C{p) > such that for any i, j > 0, i + j ~ 3, 

d^Jp[x,y) 



d^xdiy 



<C{p){xy{l-x){l-y)r^^P^ 



Claims I3T6I and 13771 follow from elementary calculus, and we defer their proofs to the appendix (Claim l3T6l is first 
proved in |MN12| and we include the proof here for the sake of completeness). Now we will use them with Taylor's 
theorem to prove Claim |33] 

Proof of Claim [Ol Fix e > and p G (0, 1), and let C(e) be large enough so that all third derivatives of J are 
uniformly bounded by C(e) on the square [e, 1 — e]^ (such a C(e) exists by Claim [3^ . Taylor's theorem then impUes 

that for any a,b,a + x,b + y & [e, 1 — e], 

dJ dJ 

Jp{a + x,b + y) < Jp{a, b) + ^^^(a, b) + V^i^^ ^) 



+ C7(p)e-^('')(a;3+y3). (2) 



Now suppose that X and Y are random variables taking values in [e, 1 — e]. If we apply (|2]) with a = 'EX, 
b = 'EiY , x = X — E X, and y — Y — EY , and then take expectations of both sides, we obtain 



EJp{X,Y) < Jp(EX,Er) 



1 



E 



(X Y) 



^ dxdy ^ 



C(p)e^^('''(E|X|3 + E|f|3) (3) 
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where X = X -E X and? = Y -EY. Now, if X and Y have correlation a e [0, p] then EXY = a^EX^EY^, 
and so 



E 



where ctjc = VeX^ and ay = VeY^. By Claim 

{ax ay) [ , 



cry 



Applying this to (O, we obtain 

E Jp(X,r) < Jp{EX,EY) + C{p)t-^^P\E\X\^ + E\Y\^) 



□ 



3.2 The inductive step Next, we prove Theorem l3.4l bv induction 



Proof of Theorem \3 .4\ Suppose that the Theorem holds with n replaced by n — 1. Consider / : fi" — > [cj 1 — e] and 
g:f}^-> [e,l-e]. 

Conditioning on y„) and writing X = (Xi, . . . , Xn-i), Y = {Yi, . . . , Yn-i), we have 
E g{Y)) = E^„^y„ E^ y Jp( fx„ (X), gy^ (Y)). 

Applying the inductive hypothesis for n — 1 conditionally on X„ and Yn, 

E^^yJp{fxAX),gy^{Y)) 

< Jp(E[/xjX„],E[5yjr„]) + C(p)e-^('')(A„-i(/xJ + A„_i(/yJ). (4) 
On the other hand, the base case for n = 1 implies that 

Ejf„,y„ Jp{E[fxjX„lE[gyjY^]) 

< J,(E/,E.g) + C(p)e-^('')(Ai(E[/xjX„]) + Ai(E[5yjy„])). (5) 
Taking the expectation of (01) and combining it with (|5]), we obtain 

EJp{f{X),g{Y))<JpiEf,Eg) 

+ C(p)e-C'(p) ( Ex,. A„-i (/x„ ) + Ey„ A„_i (/y„ ) 

+ Ai(E[/xjX„]) + Ai(E[5yjy„])). 

Finally, note that the definition of A„ implies that the right-hand side above is just 

Jp(E/,Eg) + C(p)6-^('')(A„(/) + A„(5)). 

□ 



4 Borell's inequality 

The most interesting special case of Theorem |3.4| is when ili ~ fl2 ~ { — 1, 1} and the distributions of Xi, Yi satisfy 
EXi ^EYi = Q,E XiYi = p. In this section and the next, we will focus on this special case. First, let us recall the 
functional version of Borell's inequality that was given in IIMN12II . 
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Theorem 4.1. Suppose that Gi and G2 are Gaussian vectors with joint distribution 

For any measurable /i, /2 : K"* — )• [0, 1], 

EJ(/i(Gi),/2(G2)) < J(EA,E/2). 

We will prove Theorem |4T| using Theorem [33] and a crude bound on A„(/) (in the next section, we will need a 
much better bound on A„(/) to prove that "Majority is Stablest"). 

Claim 4.2. For X e {-1,1}", define 

X ' = (^1, ■ • • , Xi^i, ~Xi, XiJ^i^ . . . , Xn). 

Then 

n 

A„(/)<^E|/(X)-/(X-')|3. 

1=1 

Proof. The proof is by induction: the base case is trivial, while the inductive step follows by Jensen's inequality: 

A„(/) = Ex„ [A„_i (/x„ )] + Ex J E[/x„ |X„] - E /|3 

< ^ E\f{X) - f{X-^)\' + Ex„ I E[/xjX„] - E[/_xJX„]|3 

i=l 
n-1 

< ^ E \f{X) - f{X-')\^ + E \f{X) - 



□ 



Proof of Theorem \4.1\ Let n = md and, for each i = 1, . . . , d, define 

m 2rn rnd 



Define G2.n similarly by with Y instead of X. By the multivariate central limit theorem, (Gi^„, G2.n) (Gi, G2) 
as TO — >■ 00. 

Suppose first that /i and /i are L-Lipschitz functions taking values in [e, 1 — e], and define 51, 52 : { — 1, 1}" by 
.9,(X) = /^(Gi.„). By TheoremEai 

EJ(.gi(X),. 92(F)) < J(E.gi,E.g2) + G(e)(A„(.gi) + A„(.g2)). (6) 

Since is i-Lipschitz, 

\g.^{X)-g,{X-^)\<^ 

VTO 



for every j, and so Claim |431 implies that 

A„(gi) < — ryj = —1^. 

Applying this to 

E J(gi (X) , g^iY)) < J(E gi , E 52) + G(6) l^^'^^^^) 
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and so the definition of gi implies 



EJ(/i(Gi,„),/2(G2,„)) < J(EA(Gi,,),E/2(G2,„)) + C(e)-^''^^^'^ 



Taking m oo, the central limit theorem implies that 

EJ(/i(Gi),/2(G2)) < J(E/i(Gi),E/2(G2)). (7) 

This establishes the theorem for functions /i and /2 which are Lipschitz and take values in [e, 1 — e]. But any 
measurable /i,/2 : [0,1] can be approximated (say in 7^)) by Lipschitz functions with values in 

[e, 1 — e]. Since neither the Lipschitz constant nor e appears in (|7]l, the general statement of the theorem follows from 
the dominated converaence theorem. □ 



5 Majority is stablest 

By giving a bound on A„(/) that is better than Claim l4~2l we can derive the "Majority is Stablest" theorem from 
Theorem 13.41 Indeed, we can express A„(/) in terms of the Fourier coefficients of /, and we can bound A„(/) in 
terms of the max influence of /. For this, we will introduce some very basic Fourier analytic preliminaries below. 

Fourier analysis: We start by defining the "character" functions i.e. for every 5 C [n], define xs{x) : { — 1, 1}" ^• 

M as xs{x) = Yiies ^i- Now, every function / : {— 1, 1}" R can be expressed as 

f{x)=Y.f(^^^s{x) RS)^ E [f{x)-xsix)] 

The coefficients f{S) are called the Fourier coefficients of / and the expansion of / in terms of f{S) is called the 
Fourier expansion of /. It is easy to show that X]sc[ti] /^('^) = Exef-i.i}" [/^(a^)]- This is known in literature as 
Parseval's identity. Similarly, for any p £ [—1, 1] and x E { — 1, 1}", we define y x as the distribution over 
{ — 1,1}" where every bit of y is independent and E[xiyi] ~ p. This immediately lets us define the noise operator 
Tp as follows : For any function / : { — 1, 1}" M, Tpf{x) = Eyr^^xlfiv)]- The effect of the noise operator Tp is 
particularly simple to describe on the fourier spectrum. Tpf{S) = p^^^f{S). The reader is referred to the excellent 
set of lecture notes by Ryan O'Donnell lO'DOTI for an extensive reference on this topic. 

It is also important to remark here that while we prove the ' 'Majority is Stablest' ' theorem for the hypercube with 
the uniform measure, one can easily derive analogues of this theorem for more general product spaces by extending our 
machinery. Instead of using the fourier expansion of the function, one has to use the Efron-Stein decomposition (see 
the lecture notes by Mossel IIMos05l for an extensive reference on the Efron-Stein decomposition). All the statements 
that we prove here have analogues in the Efron-Stein world. We leave it to the expert reader to fill in the details. 

We start by extending the notation of Definition 13. 21 



Definition 5.1. For disjoint sets S,T C [n], and elements x e { — 1, 1}'^:?/ <= {^li 1}"^. write x ■ y for their 
concatenation in {—1, 1}'^'^"^. 

For a function / :{— 1,1}"— >R, a set S C [n], and an element a;G{ — 1,1}'^, we define '■ {—1, l}!"!^'^ — > R 
by fx{y) = f{x-y). 

Our first observation is that A„ of / can be written in terms of Fourier coefficients of random restrictions of /. 
Claim 5.2. If St = {i + 1, . . . ,n}, then 

n 

1=1 

Proof. The proof is by induction. The base case is just the fact that for a function / : {— 1, 1} — > R, 

|7(1)I'= /(l)_iLi)'^E|/-E/p. 
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For the inductive step, we have 



A„(/) = Ex„[A„_i(/xJ] + Ai(E[/;fjX„]) 



E 



^Ex.+,,...,x„_J/xW| = 

i=l 



\fin)\' 



□ 



In order to control the Fourier coefficients of restrictions of /, we can write them in terms of the Fourier coeffi- 
cients of /: 

Claim 5.3. For any disjoint S and U and any a; €{ — 1,1}'^, 

TCS 

Proof. Fix S and x. Let g : { — 1, 1}" ^ M be the function such that g{y) = 1 when yi = Xi for all i ^ S, and 
g{y) — otherwise. It is easy to check that the Fourier expansion of g is 

g{y) = 2-1^1 XTix)xT{y). 

TcS 

Then 

fxiU) = Ex[„,\s fx{X[n]\s)XuiX[n]\s) 

= 2l^l Ex f{X)g{X)xuiX) 

= Ex f{X) XTix)xT{X)xuiX) 

TcS 

= Y^T{x)f{Tyju). 

TcS 

In particular, the identity of Claim |53] allows us to compute second moments of fx- 
Claim 5.4. For any function f : { — 1, 1}" — > M, any x G { — 1, 1}'^ and any i £ U CZ [n], 

Exe{-i,i}H/^(f/)l' <Inf.(/). 

Moreover, if Si ^ {i -\- 1^ . . . ^n} then 

n 

Y^xe{-i,i}s. |/^W|'-Var(/). 
i=i 

Proof. In view of Claim 1531 we can write 

l/^(C^)l'= E XTiXs)xT'{Xs)f{TUU)f{T' UU). 

T.T'CS 

When we take the expectation with respect to Xs, E Xt{Xs)xt' (Xs) = St.t' and so the cross-terms vanish: 

Ex. E?(ruC/). (8) 

TCS 



□ 
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Since Inf ;(/) = X^Tgi /^(^)' '^he first part of the claim follows. 
For the second part, 

n n 

J2 Exe{-i,i}s, Ihcm' = E E /"(^ U {^}) = E /'(^)' 

i=l i=l TcSi UCln],U^lll 

where the last equality used the fact that every non-empty U C [n] can be written uniquely in the form T U {i} for 
some T C {i + 1, . . . , n}. But of course the right-hand side of (|9]l is just Var(/). □ 

Next, we will consider fxin — i) as a polynomial in x and apply hypercontractivity to the right hand side of 
Claim 1521 First, note that T(j commutes (up to a multiplicative factor) with restriction: 

Claim 5.5. For any < ct < 1, if S, U C [n] are disjoint then, as polynomials in x = {xi)i(=s, 

(T7j).{U)=a\''\T,{l{U)). 

Proof. Bv Claim 1531 



Uu)=Y.^T{x)f{T\JU). 



□ 

-2 



Since J\J(T \JU)= crl^l+l'^l/(S') and T„xt{x) = (t^^^Xt, it follows that 

{TW).{U) = E 't"^'+"''xt(x)/(TU [/) = al^lr.(/,(C/)). 

TCS 

Essentially, Claim |531 allows us to apply the Bonami-Beckner inequality to fx- for any o'<l, ifp=l + (T 
then 

Exe{-i,i}- liTjTxm" < (Ex lIxm^T^^- 

By ClaimEl if ieU then 

Bxe{-is}s \{TWTx{UW < (Inf,(/))^(Ex \fxiU)\'). 
Applying this to Si = {i + 1, ... ,n} and Ui = {i} and summing the result over i = 1, . . . , n, we obtain 

EExe{-i,i}-. \iTJfxi^)\'' < (maxlnf,(/))^ Var(/). 

Now, if / takes values in [—1,1] thenVar(/) < 1 and all Fourier coefficients of / (and its restriction) are bounded by 
1. Hence, Claim ls!2] implies the following: 

Claim 5.6. //I + cr"^ < 3 then 

A„(T,/) < (maxlnf,(/))^. 

i 

Now we are ready to prove the "Majority is Stablest" theorem. For this, we define as 

Sp(/)=E,e{-i,i}.,,^,.[/(x)/(y)] 



Theorem 5.7. ForanyO < p < 1, there are constants < c{p),C{p) < oo such that for any function f : { — 1, 1}*^ 
[0, 1] with maxi Infi(/) < r, 

S,(/)<J,(E/,E/) + C(p)i^|M^. 
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As remarked earlier, our proof extends to the generalizations of Theorem |5 . 7 I such as those presented by IIDMR06I 
IMoslOI . The extension of the proof uses the Efron-Stein decomposition instead of the Fourier decomposition. The 
only difference is that the hyper-contractivity parameter will now depend on the underlying space. See IIMoslOl for 
more details. 

Proof. Suppose / ; { — 1, 1}" — > [e, 1 — e] satisfies max^ Infi(/) < r, and let X, Y be uniformly random elements of 
{—1, 1}" with EXiYi = p. For sufficiendy small i] > 0, Claim |5^ impUes that 

Note that the range of Ti_^/ belongs to [e, 1 — e] because the range of / does. Hence, Theorem |3 .41 applied to Ti-r;/ 
implies that 

E Jp(Ti_,„/(X),ri_,,/(r)) < Jp(Eri_„/,ETi_,„/) + A„(/) < J,(E/,E/) +Ce-^(^)T-''. 
Since Jp{x, y) > xy, it follows that 

§p(i„^).(/) = Eri_„/(x)ri_„/(r) < jp{Ef,i]f) + Ce-^^p^T-^. 

This inequality holds for any < p < 1; hence we can replace p{l — r/)^ by p to obtain 

Spif) = ETi_„/(X)Ti_,,/(r) < Jp(i_„)-.(E/,E/) + Ce-C'('')T='' (10) 

for any p < (1 — 77)^. 

Now, ( [Tol l holds for any /:{—!, 1}" [^i 1 ^ e]- For a function / taking values in [—1, 1], let be / truncated 
to [e, 1 - e]. Since | E - E /| < e and (by the proof of Claim IIS ^"^^ < 1 for any p, 

Jpii-n)-- iEr,En< Jp(i_„)-2 (E /, E /) + 2e. 

On the other hand, | / — /^| < e and so 

§p(/) = E/(X)/(r)>§p(r)-2e. 
Thus, ( [Tol l applied to /' implies that for any p < (1 — 77)^ and any e > 0, 

Sp(/) < Jp(i-,)-.(E/,E/) + 2e + Ce-^('')r=''. 

If we set e = W(2C(P)) then 

§p(/) < Jp(i-,)-^ (E /, E /) + Cr^W". 
Finally, some calculus on Jp (see Claim lATTT i shows that | ^'^''gp'^'' | < ( \/l — for any x, y; hence 

§p(/) < Jpii-n)-^ (E /, E /) + ^^('^^^2)372^ + ^ '^^(E /, E /) + C(p)(ry + r^")"). 

Choosing /y = C{p) completes the proof as long as p < (1 — 77)^. However, we can trivially make the 

theorem true for (1 — 77)^ < P by choosing C{p) and c{p) appropriately. □ 



6 SoS proof of Majority is Stablest 

The principal theorem of this section is the SoS version of "Majority is Stablest" theorem of OMOOIOI . Before we 
state the theorem, we will need a few definitions. We will consider the indeterminates f{x) (for x G {—1,1}"). The 
constraints on these indeterminates is given by 

= {0 < f{x) < 1 : for all x e {-1, 1}"} 
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. As is the case with the usual setting, its helpful to define the fourier coefficients of /. 

For5C[,^] 7(5)= E f{x)-xsix) and hence /(x) - V 7(^)xs(a:) 

Note that f{S) are nothing but linear forms in terms of the original indeterminates. It is also helpful to recall the 
notion of influences and low-degree influences in this context. 

S3i S3i:\S\<d 

With this, we state the main theorem of this section. 

Theorem 6.1. For any n > and p £ (—1,0), 3do — do{K^ p), di — di{n, p) and c — c{k,, p) such that 

1 " 

A, E [/(.t) • f{y) + (1 - fix)) ■ (1 - f{y))] > 1 - - arccosp - k-c- (^(Inf^V)') 

^e{-i,i}" 7^ 

This is easily seen to be equal to the statement of the Majority is Stablest theorem of IMOOIOI . Before, we delve 
further into the SoS proofs, we feel its good to familiarize ourselves with the fourier machinery in the SoS world. The 
upshot of the discussion ahead is going to be that the basic fourier identities and operations hold without any changes 
in the SoS world. In particular, it is easy to verify that Parseval's identity holds i.e. for {/(.t)} and {/(5)} defined as 

above E[/2(x)]=Escn/'(5)- 

Similarly, we can define the noise operator Tp here as follows : Given the sequence of indeterminates {f{xY\x£{-i,i)"^^ 
we define the sequence of indeterminates {g{x)} as 17(2:) = 'Eyr^ ^x[f {x)] and for every x, use Tpf{x) to 
refer to g(x). It is also easy to check that if we define 3(5*) ='E,x[g{x) ■ xs{x)], t\\&n'g{S) = p^^^f{S). 

6.1 Smoothening the function For our purposes, it is necessary to do a certain smoothening of the function /. In 
particular, we start by considering a new function /2 i.e. we create a new sequence of indeterminates defined by 
f2{x) = (1 — i)f{x) + e/2 for some e > 0. The value of e shall be fixed later. We observe that 

Ap hi U,g{_i,i},.{e < h[x) < 1 - e} (11) 

/i(5) = (l-e)/(5) + (6/2).ls=* 
We also make the following claim (the proof is deferred to Appendix iDb. 

Claim 6.2. 

Ap h2 f{x)f{v) - 2e < h{x)h{y) < f{x)f{v) + 2e 

The next stage of smoothening is done by defining g = ri_,,/2 for some 77 > 0. Again, the value of 77 will be 
fixed later. 

Uxei-i,!}" {e < hi^) < 1 - e} ^1 U,6{_i,i},.{e < g{x) <l-e) (12) 

\s\ 



g(5) = (l-e)(l-r/)l^l/(5) + (6/2).l5=* 

Also, observe that "Ex^yr^i^xlhix) ■ /2(y)] = '^x,yr^p,x[g{x) ■ <?(?/)] where p' = p/(l — 77)^. Of course, this imposes 
the condition \p\ < |1 — 77^. So, we have to choose jy to be small enough. Now, define the constraint set A'^ = 
Ua;g{_i.i}n{e < g{x) < 1 — e}. So, we summarize the discussion of this subsection in the following two claims. 

Claim 6.3. For any q and d <E N, if A^ \-d q > 0, then Ap\-d q > 0. 

The proof of the above is obtained by combining ( fTTT i and ( fT2] i with the third bullet of Fact l2.3l The second claim 

is 

Claim 6.4. 



Ap h2 Kx,y^^4f{x) ■ f{y)] > Kx,y^,x[g{x) ■ g{y)] - 2e 
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Thus, the above two claims mean that from now on, we will work with A'^ and aim to prove a lower bound on 
'^x,yr^p,x[g{x) ■ g{y)]- At this stage, let Jpi be the approximation obtained from Claim lBTI with parameter e > and 
d = e. For the sake of brevity, we indicate this by J itself. The following claim allows us to compare the terms x ■ y 
and J{x, y). 

Claim 6.5. For any e > 0, such that p' G (—1,0) and J is as described above, there is a da — da (e, p') such that, 

{e<a;<l — e,e<y< 1— e} h^^ x ■ y > J{x, y) — 2e 

Proof. Note that for {x,y) £ (0, 1)^, Jo(x,?/) = xy and hence by Slepian's lemma, we get that if p' < 0, then 
xy > J pi [x, y). Now, by definition, we have that for (x, y) G [e, 1 — e]^ , a;?/ > J{x, y) — e. In other words, if we 
define the polynomial p{x, y) ~ xy ~ J{x, y) + 2e, then we know that for {x, y) G [e, 1 — e]^, p{x, y) > e. We can 
thus apply Corollarv lC.lOl to get that there is an integer da ~ da{£, p') such that for (x, y) G [e, 1 — e]^, for p' G (0, 1), 

{e < a; < 1 - e, e < J/ < 1 - e} hd, p > 

Expanding p, finishes the proof. □ 

6.2 Taylor's theorem in the SoS world Following the proof of Majority is Stablest, we now need to prove a 
Taylor's theorem in the SoS hierarchy. The following lemma is the SoS analogue of Claim [331 

Lemma 6.6. Define a sequence of indeterminates {/io(l), ho{—l)^ ^i(l), 1)}. Let A be a set of constraints 
defined as A = U,; jg{o.i}{£ ^ hi(j) < 1 — e}. For any e > 0, p' G (—1, 0), 3cj = c-y(e, p') and 3d^ = d^{^, p') 
such that 

A^d, E [J{ho{x)M{y))] > J(MO),MO))-e- (i) + /M'(i))-c^-(C(i) + /nV)) 

xeni-lS} 

y~p'X 

where h,{j) = hM+i-^Lh^ for i, j G {0, 1}. 

Proof. We start by noting that since J is a symmetric polynomial, hence we can write 

m.n:rn-\-n<K 

Here, we assume that K is the degree of J and c is the maximum absolute value of any coefficient. . We next make 
the following claim. 

Claun 6.7. 

E [J{ho{x),h^{y))]= ^rn.„-C(l)-/^"(l)- f^ + (-ir-^ 

xeR{-i,i} \ ^ 2 

y~ t x m,n:ni-\-n IS even 

where 

E-p-mi-rn -^ni-n f mi\ f Tli 

fJ-mum ■ ho (0) - hi (0) • 

\mj\n 

The proof is deferred to AppendixiDl Next, we note that i^o.o J{ho{0)^ hi{0)). Thus, we get that 

E [J{ho{x),h,{y))]-J(ho{0)Xm^ V i^m,n-C(l)-C(l)Yi^ + (-ir- 
xGr{-i,i} \ 2 

-i/^ IX m.n:m+n IS even 

" K>m+n>2 

We first make the following claim which bounds the terms when m + n > A. 



I-P' 
2 

(13) 
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Claim 6.8. 



rn,n:m'\-n is even 
and m+n>4 



where Y = 2cK'^2'^^ (h^^ (I) + {I))- 



Again, we defer the proof of Claim |6^ to Appendix iDl Thus, we are only left with the task of controlling the 
terms when to + n = 2. Note that 

• ^'"(1) ■ ^"(1) ■ + (-1)™ ■ = ^2,0 • 'ho {I) + ■ + pVi.i • 

m+n=2 ^ ^ 

For the sake of brevity, call the above quantity A. Next, we observe that at x ^ HqIO) , y ~ /ii(0) 

d^Jix, y) _ „ d^Jix, y) _ d^J{x, y) _ 

-—5 — 21^2.0 TT^ — 2j/o,2 Tr~^ ~ '^l,! 

ox'^ oy^ oxoy 

To see this, note that 

d''J{x,y) ^ ^V(M0) + a;^M0) + ^/') 

dx"^ x=ho{0).y=hl(Q) dx'"^ x'=0,v'=0 

However, the quantity on the right side is simply twice the coefficient of x'^ in the polynomial J(/io (0) +a;' , hi (0) +y') 
which is exactly 21^2,0- The other equalities follow similarly. Thus, we get that 

In the above, all the derivatives are evaluated at x = ho{0), y = /ii(0). We now make the following claim which 
gives a lower bound on A. 

Claim 6.9. For every e > and p e (—1,0), there exists d'^ = d'^ (e, p') such that 

^hd, A>~e.(/j;'(l) + /^'(l)) 

We defer this proof to Appendix|D] Now, set d~f = max{(i^ , 4A' + 3} and c~, = 20/^*2^^ . Combining Claim l6.8l 
and Claim l6.9l with ( fT3] ). we get Lemma l6^ □ 

6.3 Tensorization: We now do a ' 'tensorization" of the inequality in Lemma |63] Let {(t>{x)} be a set of 

inde terminates. We recall that for y e { — 1, 1}*, we define the set {0y(2:)}z6{-i.i}"-' of indeterminates as follows : 
<i)y{z) = 0(2 • y). As before, we can define the fourier coefficients 4>y{S) for S C [n — i] and it is easy to see that 
they are homogenous linear forms in the indeterminates (j>{S) (for 5* C [n]). We now state a few basic properties for 
the indeterminates gy{z) and gy{S). 

A'^iU U U {e<gyiz) < 1 - e} (14) 

n—1 

A'hi U U U {-1 < gy{S) < 1} (15) 

^2 E [.9;'(n-»)]= J2 9'{S) (16) 

n—i^S 



16 



Lemma 6.10. For the parameters = c^(e, p') and dj ~ ^^(e, p') from Lemma WM 

A[,^d, E [J{g{x),g{y))] > J(E[g(a:)], E[g(y)]) - e ( E,e{_i,i}. [g^^^ _ 



C. 
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^E,e{_i,i}.[5^(n-i)] 



The proof of this claim is a very simple induction. For the sake of completeness, we give the proof in AppendixiDl 
We now simplify the error terms. Towards this, note that ( fTSI l implies that 

\r=0 J s^^ s ^e{-i.i}" 

Further, A'^ I-3 Ei^g{_i,i}" [5^(2^)] < 1 (using Fact |C.2t . Thus, we get that 

E [J(5(x),g(y))] > J(E[.g(x)],EL9(y)])-e-cJ'^'E,g{_i,i}.[g^(n-z)] 1 (17) 

6.4 Bounding the error terms: Thus, all we are left to bound is the ' 'degree-4" term. We briefly describe why one 
has to be careful to get a (meaningful) upper bound here. The reason is that the obvious strategy to do this is to break 
g into high degree and low-degree parts based on the noise parameter (call them h and £). Now, this very naively 
gives an error term of the form Exh^{n — i) and Ej;i';^(7i — i). The latter can be easily bound using hypercontractivity. 
However, there does not seem to be obvious way to bound the former. This is in spite of the fact that E^/i^(n — i) is 
small. We now show how to get around this problem. 

Wedefined^ = (l/?7)-log(l/7]). Now, define the sequence of indeterminates {/i(x)}2;g{_i,i}" and {£(x)}^£{_i_i}r. 
as follows : 

h{x) = 9{S)xs{x) iix) = g{S)xs{x) 

\S\>dr, \S\<dr, 

By the way it is defined, it is clear that hi h{x) + i{x) = g{x). Now, we can analyze the term E2.g{_i ij; \g^{n — i)] 
as 

n— 1 n— 1 



h4 ^E^e{_i^i}4^(n-i)] = ^(E^e{_i.i}.[g^(n-i)(/i^(n-i)+4(n-i))]) 

4=0 4=0 

n-1 



4=0 



+ E^g{_i_ip[g^(n - i)ix{n - i)K{n - 

We begin by stating the following useful fact : 

Fact 6.11. Ap hg Er=o' ^.ei-i.iyhlin - i) < v 
Proof. 

n-1 

i=0 S \S\>d^ \S\>d„ \S\ 

Combining the two facts, finishes the proof. □ 
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We now make the following claims. 
Claim 6.12. Ap hg ^"=0^ Exe{-ia}' [gl{n - i)hx{n - i)] < 

Claim 6.13. HJo' E.e{-M}.[?^(^ " *)^.(" - < ^/^+ $i(Er=i(Inff'" (/))') 

Claim 6.14. A^ Efo' E,e{_i,i}. _ _ _ < 2^^+ ^(Elli (Inff '"(/))') 

The proofs are deferred to the appendix. Combining ( flTl i with Claim l6T2l Claim l6T3l Claim l6T4l along ( fTTI i and 
Claim |631 we get that for and d-y described in Lemma I631 

Ap^d, E [Jigix),giy))] > J(E[5(a;)], E[5(y)]) - e - 4 • c^^^ 

2;G{-1,1}" 



i=l 



Using Claim |6.5l we have that h^^ 9{^)'9{y) ^ >^(.9(2;)i 5(2/))~2e. Similarly, combining this with Claim |6.4| 
we can get that 

Ap h<i„ E,,^^^,[/(x) • /(y)] > E,,,^^,,[J(g(x),g(y))] - 4e (18) 
Thus, now applying (fTST l, we get 



E [/(x)-/(y)]> J(E[g(x)],E[g(y)])-5e-4.c^V?^~ ' ^(Inff (/))M 

(19) 

Now, define a new sequence of indeterminates {fc{x)}xe{-i,i}'^ where fc{x) = 1 — f{x). Next, define f2c{x) = 
(1 — e)fc{x) + e/2. Next, we define gdx) = 'Eijr^j^_^x[.f2c{x)]. We now make the following observations ; 

• Vx e {-1, 1}", Ap hi Vx e {-1, 1}", e < ,9c(a;) < (1 - e). 

• F,x[g{x)]+'EA9c{x)] = 1. 

• For all i e [n], Inff = Inff '^"/c- 

Thus, using the above, analogous to (fT9] i. we have the following : 

Qf^ / " \ 

Ap ^m..{d-,.ci^} E [/c(.T)-/c(y)] > Jingcix)m9ciy)])-5e-A-c,^j ^^(Inff 

(20) 

Now, define ^ as C = 5e - 4- c^^- ^^^^^ (Er=i(Inff '^"(Z))^)- Summing up (O and (EOll, we get 

ApUax{d„d„} E [f{x)-f{y) + {l-f{x))-{l~f{y))] 

a;G{ — 1,1}" 

> J(E[5(x)], E[g(y)]) + J(E[1 - g(x)], E[l - g{y)]) - 2^ (21) 

Next, we recall the following fact : 

Fact 6.15. For any a e (0,1) and pe (-1,0), 

Jp{a,a) + Jp{l-a,l- a) > 2Jp(l/2,l/2) = 1 - ^ISSSIE 
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Combining this fact with Claim lBTl we have that for every x e [e, 1 — e], 

~, ^ ~, , arccosp' 
J{x, x) + J{1- x,l~ x)>l — - 2e 

TT 

By using Corollarv lC.lOl we have that there exists ds = ^^(e, p') such that 

A, h,, J(E[g(x)], E[g(y)]) + J(E[1 - 5(x)], E[l - g[y)]) > 1 - ^^^^ - 4e (22) 

TT 

Combining (l2Tl i and (|22] |. we get 

^max{d„d„.d.} E [/(x) • f{y) + (1 - /(x)) • (1 - fiy))] > 1 - ^122211. _ I4e - 8 • c^V^ 

xe{-l.l}" TT 

_ l^j^ilntr^nr^ (23) 

From here, getting to Theorem |6.1| is pretty easy. We proceed as follows : 

• For the given p and k, first we choose e = k/100. This implies that 14e < k/4. 

• Next, observe that c^(p', e) is a uniformly continuous function of p' and e. Now, recall that p' = p/{l — rj). 
Hence, there exists 770 = ?/o(P: e> ^) such that for all r] < r/o, • c^ip' , e) < k/32. 

• Again, observe that for any p E (—1,0) and k > 0, there exists 771 = rji{p,K) such that for all 77 < 771, 
(arccos/9')/7r < (arccosp')/7r + k/4. 



Now, choose 77 = min{77o, 7/1}. With r; and e having been fixed in terms of k and p, we set dp (k, p) ~ max{(i^, da, d^}, 
c(k, p) = ^ Jif ' ^'^'^ c'll'^j p) and hence get 

A, U(K.rt E [/(x) • m + (1 - /(x)) • (1 - f{y))] > 1 - - - c(^, p) ■ f f](lnf f^^C^'") 



This finishes the proof of Theorem l6.1l 

7 Refuting the Khot-Vishnoi instances of MAX-CUT 

In this section, we will prove the following theorem : 

Theorem 7.1. Let p e (—1,0) and Gp = {Vp,Ep) be the Max-Cut instance constructed in IIKV051 for the noise 
parameter p. Let {xtjj^gy be a sequence of indeterminates and A = Ut,gv/{0 < x^ < 1}- Then, for any (5 > 0, there 
exists di = di {6, p) such that 

Aul E a;„ • (1 — Xu) + x„ • (1 — x„) > — arccos p + 1 hdi — 1 > 

L(u,«)e-E TT J 

For this section, it is helpful to begin by recalling the following theorem of O'Donnell and Zhou IIOZ13II . 

Theorem 7.2. IIOZ13I Lef { f fa:)}^g{_i i}n beasequenceof indeterminates and let A = y^xe{-i.i}^{^ £ ^ 
Then, for any 5 > Q and p G (—1, 0), 

A ho(i/5.) Stab,(/) > K{p) - 5 - 20(i/5^) . Ji(^^ 

where K{p) = i + ^ + (1 - i) . p3 
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The main application of Theorem |7.2| in IIOZ13I is the following : Khot and Vishnoi IIKV05I construct instances 
of MAX-CUT (parameterized by noise parameter p) whose optimum is bounded by (arccosp)/7r + o(l) (and yet the 
basic SDP relaxation for MAX-CUT has value (1 — p)/2.) O'Donnell and Zhou essentially use Theorem 17.21 as a 
black-box to give a constant degree SoS proof that these instances have optimum bounded hy 1 — K{p) + o{l). This 
improves significantly on the bound provided by the basic SDP. 

In this section, we will show how we can use the stronger version of Theorem 17.21 namely Theorem 16. II to do 
even better. In particular, we will use this theorem to give a constant degree SoS proof that these MAX-CUT instances 
have optimum bounded by (arccos p)/7r + o(l) (which is of course tight). We will not give all the details of this proof 
as our proof will follow the (by now, standard) reduction from [KKMO07!| and its SoS variant from IOZ13i . 

We begin by recalling the description of instances of UNIQUE-GAMES (UG). A UG instance is specified by 
a set of vertices V and an alphabet [k]. Along with this, there is a probability distribution £ on tuples of the form 
{u, V, TT(u,v)) with 7r(„ : [k] — > [k] being a permutation. Further, the weighted graph defined by £ is regular. Also, 
let £u denote the marginal distribution on {v, tt) when the first vertex is conditioned to be u. The objective is to get a 
mapping L : V ^ [k] so as to maximize the following quantity : 

Pi"(«,!;,7r(„,„))e£[.C(w) = 7r(„_„)(£(w))] 

We next consider the SoS formulation for the UG instance described above. It is slightly different from the ' 'obvious" 
formulation and follows the formulation in IIOZ13I . In particular, we define variables Xv,i for every v E V and i E [k]. 
Now, consider the set of constraints defined by 

vev,ielk] vev [iG[fe] 

It is easy to show that if the optimum solution to the Unique Games instance is bounded by (3, then 

k 

E [V( E < 4/? 

l—l ^ ■ ' 

for any set of indeterminates {x^, i} which obeys the constraint set A^. We now make the following definition : 
Definition 7.3. Given a UG instance (V, £) with alphabet size k, there is a degree-d SOS refutation for optimum /3 if 




k , X 2' 

/A E ^V.TV^, ^(i) 



,4=1 



>/3^ >0 



Before we go ahead, we recall that for any r\ E (0, 1) and N G N (which is a power of 2), IIKV05I construct 
UG instances over 2^^ /N vertices, alphabet size n such that optimal value of the instance is bounded by N^^. 
Modifying the result from llBBH+121 . O'Donnell and Zhou IIOZ13I show the following : 

Theorem 7.4. Let i] G (0, 1) and N be a power of 2 and let (V, £) be the corresponding instances ofUG constructed 
in HKVOSV . Then, there is a degree- A: SoS refutation for optimum /3 — N~^'^^\ 

We next describe the reduction from IIKKMO07I of UG to MAX-CUT. The reduction is parameterized by a 
"correlation" value p E (—1, 0). Given the instance of UG described above, the set of vertices in the corresponding 
MAX-CUT instance is given hy V' ^ V x {—1, l}*^. Further, the probability distribution £p,k over the edges is given 
by the following sampling procedure : 

• Choose u ^ V uniformly at random. 

• Choose (m, wi, TTfu and (u, U2, 7r(„ i,^)) independently from the distribution Eu which is defined as the 
marginal of E conditioned on the first vertex being u. 

• Choose X £ { — 1, l}'^ and y ~p x. 



Of course, the interesting part is tiiat IKV05I siiows tliat the standard SDP relaxation on this instance has value 1 — r] 
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• Output vertices 7r(„,„^)(a;)), (u2, 7r(„^„^)(y))) 
Now, for a function g : { — 1, 1}'"' [0, 1], let us define Stabp(g) as follows 

Stabp(g) = E [gix) ■ g{y) + (1 - g{x)) ■ (1 - g{y))] 

We have the following simple claim : 

Claim 7.5. IIKKMO07I Let G' — {V ^ Ep_k) be an instance of MAX-CUT described above. Consider a partition of 
the graph G' (into two sets) specified by a collection of functions {fy : { — 1, l}*^ — >■ {0, 1}}. Then, the value of cut 
defined by this partition is 1 — Euev [Stabp((7„)] where 

gu : {-1,1}'' [0,1] isdefinedas gu{x) = E [/t,(7r(a;))] 

(t>,7r)6£„ 

Consider the SoS relaxation of the MAX-CUT instance defined by V' and £p,k- In particular, we have an indeter- 
minate fv{z) for every v & V and z G {—1, l}''- The constraint set Am is given by Am = U^gy ^ze{-i.i}'' {0 — 
fv(z) < 1}. Then, O'Donnell and Zhou IIOZ13I show that if {V,£) is a UG instance such that there is a degree 4 
refutation for the optimum /3, then 

Am U {1 - E [Stabp(g„)] > K{p) - S - 20(i/*')/3} ^o{i/s^)+4 -1 > (24) 



This of course means that the if the BKKMQQ7I reduction is appHed on the instances from Theorem l7.4l 

AmU{l- E [Stabp(gJ] > K{p) - 6 - 2^'^^""^ ■ iV-^'")} ^o[i/s-)+a -1 > 

Exactly following the same steps as IIOZ13I . but using Theorem l6.1 [ instead of Theorem l7.2l we show that for any 
(5 > 0, 

Am U {1 - E [Stabp(g„)] > {a.vccos p) / it -5- d2{5, p)c{5, p) ■ (5} ^d^(5.p)+i -1 > (25) 
We do not repeat the steps here and leave it to the reader to fill the details. Using (5 = N^^^^'^\ we get Theorem l7.1l 
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A Facts regarding Jp 

Here we collect various facts about the function 



where (X, Y) ^ A/'(0, (pi))- These calculations all follow from elementary calculus. 

Claim 3.6. For any {x,y) G (0, 1)^ and < <J < p, Mp^{x,y) is a negative semidefinite matrix. Likewise, if 
p < cr < 0, then Mp^{x, y) is a positive semidefinite matrix. 

Proof. Towards proving this, note that we can define Y = p ■ X + y/l- p^-Z where Z - 1) is an independent 
normal. Also, let us define <i>^^(.T) = s and $^^(y) = t. For s, t e R, define Kp{s, t) as 



Kpis, t) = VrxAX <s,Y<t] = Prx,z[X <s,Z<{t~p- X)/^l-p^] 
Note that for the aforementioned relations between x, y, s and t, Kp{s, t) ~ Jp{x, y). Note that 



Kp{s,t)^ (j>{s') cl){t')ds'dt' (26) 

Js' — — 00 Jt' — — CG 



This implies that 



dKp{s,t) 



(t-p-6)/yi3^ 
(j){s) I (j){t')dt' 



By chain rule, we get that 



ds 

dJp{x,y) _ dKp{s,t) ds 

dx ds dx 

By elementary calculus, it follows that 

d<^-\x) 1 ds _ 



dx 0($-i(x)) dx ?i($-i(a;)) (f>{s) 

Thus, 

dJp{x,y) 



dx 



(t-p-s)/Vi-P= 

(j){t')dt' 



Thus, we next get that 

d^Jpix,y) _d^Jpix,y) ds t-p-s\ -p 1 _ /$-i(y) - p • $-i(x) 



dx^ dxds dx ) <Pis) \ J </*(«) 

d^Jp{x,y) _ d\Jp{x,y) dt _ U-\y)- p-<i>-\x) \ I J_ 

dxdy dxdt ' dy "^y j ' ' m 

Because we know that {X, Y) ^ (Y, X), by symmetry, we can conclude that 

d^Jp{x,y) _ ( ^-\x)~p-^-\y) \ ~p 1 
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and likewise. 



It is obvious now that 



dydx 



d^Jp{x,y) d^Jp{x,y) _ ( d^Jp{x,y) 



dx^ 



dy"^ 



dxdy 



Now, suppose that \a\ < \p\. Then 

dct{Mpa{x,y)) 



d'^Jp{x,y) d'^Jp{x,y) ^(d'^Jp{x,y) 



dx^ 



dy^ 



dxdy 



> 0. 



If p > then the diagonal of Mp„{x, y) is non-positive, and it follows that Mpa{x^ y) is negative semidefinite. If 
p < then the diagonal is non-negative and so Mp„{x, y) is positive semidefinite. □ 

Claim 3.7. For any — 1 < p < 1, there exists C{p) > such that for any i, j > 0, i + j — 3, 

d^Jp[x,y) 



<C(p)(xy(l-x)(l-y)r'^('') 



d^xd^y 

Proof. As before, we set $"^(2;) — s and $^^(?/) — t. From the proof of Claim |3T6l we see that 

d^Jp{x,y) _ ^(^-\y)- p-'i>-\x)\ -p 1 



To compute the third derivatives of J, recall that || = and |^ = we have 



d^Jp{x,y) 
dx^ 



p pt+ {2p^ - l)s 
V2^p 



■ cxp 



(l_p2)3/2 



{pt+{2p^ - l)s)cxp 



y we have 
- 2pst + (2p2 - l)s2 

<2 - 2pst + (3p2 - 2)5^ 
2(1^7^ 



Now, $ ^{x) ^ y/2 log a; as a; — >■ 0; hence there is a constant C such that $ ^(2;) < CVToga* for all x < ^. Hence, 
exp(s^) < x^'^' for all x < ^;hy symmetry, exp(s^) < — a;))^*^ for all x G (0, 1). Therefore 



exp 



t"^ - 2pst+ (3p2 - 2)s^ 
2(1^7^ 



pat (2-3p^)3^ 



= e SCl-p-i) g l-p-! g 2(l-p^ 



< e 2(l-p2) g 2(l-p2) g 2(l-p2 

P 2-3p^ 

< {x{l - x)y{l - y)) 2(i-p") - x)) 2(1-"^) 



Applying this to dZTb . we see that 



d^Jp{x,y) 



dx'^ 



<C{p){x[l-x)y{l-y)) 



-C{p) 



The other third derivatives are similar: 

d^Jp{x,y) _ \f2^p 



dx^dy (1 - p2)3/2 
By the same steps that led to ( |27] |. we get 



— 2ps) cxp ^ 



(2p2 - \)t^ - 2pst + (2p^ - 
2(1 



dx^dy 



<C{p){x{l~x)y{l~y)) 



(for a slightly different C{p)). The bounds on d^J/dy^dx and d'^J/dx^ then follow because J is symmetric in 2: and 
y. □ 
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Claim A.l. For any x, y G (0, 1), 



dJp{x,y) 



dp 



<(1-P^) 



2\-3/2 



Proof. We begin from (|26] |. but this time we differentiate with respect to p: 



dKp{s,t) 



1 



dp (1 - p2)3/2 J^, 

Since < 1 and 4){s')ds' = 1, it follows that 

dKp{s,t) 



t - ps' 



ds'. 



dp 



<(1-P^) 



2\-3/2 



Smce — a „ = — — wTr the proof is complete. 



dp 



dp 



□ 



B Approximation by polynomials 

Claim B.l. For any p G (—1, 1) and any S > 0, there is a polynomial J such that for all < i + j < 2, 

d'+Upix,y) d'+iJp{x,y) 



sup 



dx^dyi 



dx^dyi 



< S. 



Moreover, if p (z [—1 + e, 1 — e], then the degree of J and the maximal coefficient in J can be bounded by constants 
depending only on e and 6. 

The proof of Claim [BTI follows from standard results on Bernstein polynomials. In particular, we make use of 
the following theorem which may be found, for example, in ||Lor86l . 

Theorem B.2. Suppose f : [0, 1] — S- M has m continuous derivatives which are all bounded in absolute value by M. 
For any n € N, let Bnf be the polynomial 



Then for any < i < m. 



{B^f){x) = J2f{k/n)i^^jx\l~x) 
d'f{x) d\B^f){x 



n—k 



sup 

a;G[0,l] 



da;* 



dx"^ 



Seeing as the first three derivatives of Jp are bounded on [e, 1 — e]. Claim IbTI is essentially just a 2-variable 
version of Theorem lB.2l Although such a result is almost certainly known (and for more than 2 variables), we were 
not to find a reference in the literature, and so we include the proof here. 

Proof of Claim \BJ\ Suppose that / : [0, 1]^ ^ M has all partial derivatives up to third order bounded by M . Define 

gr.{x, y) = {B,J{-^y)){x) = Q f{k/n, - x)""^ 



n n 



hn{x, y) = {Bngn{x, ■)){y) = X! X! 
Fix < i + j < 2 and note that 



fc=l £=1 



kj\i 



fik/n,e/n)x'il~xr-'/il~y) 



dyi 



dy3 



d'-hnix,-) d'-gnix,-) 

= -On 



dx^ 



dx^ 



(27) 
(28) 
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Now, fix y £ [0, 1] and apply Theorem |B.2| to ( |27] i: for any x G [0, 1], 



dx'^dyi dx^dyi 
On the other hand, fixing x and applying Theorem |B.2| to ( |28] l yields 



dx^dyi 



dx^dyj 



Putting these together, 



d'+iK{x,y) d'+^f{x,y) 



dx^dyi 



dx^dy^ 



< 2C^/M/n. 



Since /i„ is a polynomial, taking n sufficiently large implies that there is a polynomial / such that /, and its partial 
derivatives of order at most 2, uniformly approximate the corresponding derivatives of /. Although we stated this for 
functions on [0, 1]^, a change of coordinates shows that it holds equally well for functions on [5, 1 — 5Y with three 
bounded derivatives. Since Jp is such a function, the first part of the claim follows. 

For the second part of the claim, note that all of the error bounds hold uniformly in p e [— 1 + e, 1 — e] since the 
third derivatives of Jp are uniformly bounded over p £ [— 1 + e, 1 — e]. Moreover, since max^^^y \ Jp{x, y)\ < 1, the 



coefficients in /i„ can be bounded in terms of n, which is in turn bounded in terms of e and 5. 



□ 



C Useful facts in SoS hierarchy 

Fact C.l. If A ^dP>QandA^q>Q, then A hd p + g > 0. 
Fact C.l. If A = {-1 < y < 1}, 

• Ifkis an even integer, A hfe+i < J/'^ < 1 

• Ifkis an odd integer, v4 — 1 < y*^ < 1 

Proof. Note that for A: = 0, 1, the conclusion is trivially true. For k = 2, note that trivially, > 0. So, we begin by 
observing that (from IIOZ13I ) 

i-2/' = ^(i + yf(i-y) + ^(i + y)(i-yf 

and hence < y < I y^ < 1. This finishes the case for fc = 2. For the remaining cases, we use induction. 
We first consider the case when fc > 2 is even. Then, trivially, we have y'^ > 0. Also, observe that 1 — y*^ = 
y^(l — y*^"^) + (1 — y^). Hence, by induction hypothesis, we have A \-k+i y'^ < 1. 

Next, consider the case when fc > 2 is odd. Again, as 1 — y'^ = y^(l — y^^"^) + (1 — y^), hence by induction 
hypothesis, we get A hj; y'^ < 1. Also, note that 1 + y^ = y^(l + y^^"^) + (1 — y^). Hence, again, by induction 
hypothesis, we get A h^. — 1 < y^ □ 

Fact C.3. -1 < y < 1 1-5 y** < y^ 
Proof. 

- / = + y)'(i - V) + ^(1 + y)(i - vfy" 

This finishes the proof. □ 

Fact C.4. Leta <y <b\-d p{y) > 0. Then, for\i,...,\k G R such that Vi G [fc], Aj > and Y^l^^ Xi = 1, we 
have that {ut^O < Z, < 1} > 0. 

Proof. In the SoS proof of > 0, whenever the term (6 — Y) appears, we simply substitute it by X^iLi Xi{b — Zi). 
Likewise, whenever the term {Y—a) appears, we substitute it by X^iLi Xi{a—Zi). It is easy to see that this substitution 
shows that {uJ^iO < Z, < 1} ^^Z,) > 0. □ 
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Fact C.5. Fork> 3, -1 < y < 1 hzfe+i < y^^ < y4 

Proof. 1 — = i(l + — y) + ^(1 + y)(l — y)^ and hence — 1 < y < 1 1^3 y^ < 1- As a consequence, for 
any j > 1. we can get that — 1 < y < 1 ^2j+i y^-' < y^''~^- Summing all the inequalities as j variables from j = 3 
to j = k, we get the stated inequalities. □ 

Fact C.6. For integers m, n > 2, {-1 < y < 1, -1 < z < 1} hi+,„ax{2m,2n} ~iy* + z^) < y"'z'' < (y^ + z^). 



Proof. h„iax{2m,2n} J/^™ + z^" > v"' z'\ Also, using Fact|c3] -1 < y < 1 \-2,n+i > y^"- Similarly, we have 
-1 < z < 1 l-2m+i > Combining these, we have {-1 < y < 1, -1 < z < 1} l-i+max{2m,2n} y™^;" < 
(y^ + z^). Replacing y by -y and z by -z, we can similarly get, {-1 < y < 1,-1 < z < 1} l-i+max{2m,2n} 
_ym^n ^ ^ ^4-1 j-^^^ completes the proof. □ 

FactC.7. For any odd integer n> 3, {-1 < y < 1,-1 < z < 1} |-„+2 -(y'' + z"^) < yz" < (y'' + z''). 

Proof. We use A to denote {—1 < y < 1,-1 < z < 1}. We first use Fact 3.10 from IIOZ13I which states 
that A I-4 yz^ < y^ + We can replace y by — y to get the other inequality. This already gives the proof 
for n = 3. For n > 3, we have that A h„+i yz" < y^z""'^ + z"+^. Now, using Fact IC.2I (Item 1), we get 
A h,i+2 z"+^ < z^. And similarly, we get A h„_2 z"^'^ < 1 and hence A h„+2 z"^'^y"' < y^. Combining these, we 
get that A h„+2 y^" < y"' + ^"^^ Replacing y by — y, we get the other side. □ 

Fact C.8. Let A h^^ < x < 1 and A \-d2 ^z < y < z where z e M<(i[X]. Then, A l^di+maxfdo.da} ~z < xy < z. 

Proof. Note that z — xy — z(l — x) + x{z — y). Now, A h(ji+d2 ^{^ — y) > and ^ l^ds+di z{l — x) > 0. 
Combining these, we get that A l^dj+maxfrf^.ds} 2;y < z. Flipping y to — y, we get the other inequality. □ 

FactC.9. teBH+1211 Let 71, c? G N and d < n. For every S C [71] such that \S\ < d, we have an indeterminate i{S). 
Forx e {-1, 1}", define e{x) = Esc[„]:|S|<d ^('5)xs(a;). Then, 

2 



^4 E [eix)]<r{ E [e\xy 

xe{-i,i}" V2:e{-i,i}" 

Consequences of Putinar's Positivstellensatz The following is a consequence of Theorem l2.4l 

Corollary C.IO. Let X = (xi, X2) and A = {xi > e,X2 > < 1 — e,.T2 < 1 — e}- Then, for any p{X) such that 
p{X) > e on V(A), there exists an integer d = d{p) such that A he; p > e/2. 

Proof. We can define the polynomials pi = .xi — e, p2 = 1 — e — P3 = 2^2 — e and p4 = 1 — e — .T2. Now, note 
that q{x,y) defined as 

q{xi,X2) = (l-e-xi)'^ -pi + ixi-e)"^ ■p2 + il-e-X2)'^ ■P3 + ix2-ef -pi 



Clearly, q{xi, X2) E M(5') and that the set {{xi, X2) : q{xi, X2) > 0} is a compact set. As a consequence, we can 
apply Theorem l2.4l to get that there is an integer d ~ d{p) such that p — e/2 > 0. This implies h p > e/2. □ 

The following is a corollary of Theorem l2.5l 

Corollary C.ll. Let X = {xi, . . . ,x„) and A ~ {pi{X) > 0, . . . ,Pm.{X) > 0} be satisfying the conditions in 
Theorem^ Let T g be such that for x € W{A), T t SI for some S > 0. Let v G (M[X])?'. Then, if 

p = ■ T ■ V, then p g M{A). 

Proof. First, by applying Theorem |2.5l we get F = ro(X) + X]i=i ' Pi {^)- Let us assume that F, = Bj ■ B,. 

Then, 

m m 

p = v'^ ■T-v = «^(ro(X) + r.(^) • P^{X))v = (Bo • vf ■ {Bo ■ v) + ^(Bo • vf ■ {Bo ■ v) ■ p,{x) 

i=l i=l 

This proves the claim. □ 
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D Missing proofs from Section |6] 

Proof (of Claim |6jl: 

fii^)h{y) - f{^)fiy) - (e' - 2e)/(x)/(j/) + J + e(l - + /(y)) 

Hence, ha /i (y) -f[x)f{y) < 2e. On the other hand, note that 1 - /(x)/(y) = (1 - + (1 - /(y)) 

and hence Ap f{x)J{y) < 1. This implies that ha !{x)f{y) - h{x)fi{y) < 2e. □ 

Proof (of Claim I6l7] |: We begin by observing that 

jihoio) + X, r.io) +y) = Y^ • (ho{o) + xr ■ (fnio) + yr = Yl • ^"^ • y" 

m,n m,n 

As a consequence, we get that 

l^7n,n ■ ho (1) • hi (1) 

j(/?;(o) - ho{i)Xi{o) - hi{i)) = ;^(-i)'"+v„,„ • ho"'{i) ■ C(i) 

Adding these equations, we get 

j(M0) + Mi),M0) + /h(i)) + J(M0)-/^(i),M0)-Mi)) ^ '^m,™ -^"(i) -^"(i) (29) 

m+n is even 

Similarly, we have that 

j(Mo) - /h(o) + /^(i)) = ^(-i)"v„,„ • /^"(i) • C(i) 

m,n 

Thus, 

j(V0)-Vi),M0)+Mi))+>/(/^(0)+/^^(i),M0)-M ^ (-irz/„,„.C'(i)-/u"(i) OO) 

m+n is even 

Hence, combining ( [29l ) and ( f30l ), we get that 

E [J(/io(x),/.i(2/))]= 5] ('i±^ + (_l)™.i_^') 

xG„j-i,i} V 2 2 y 

y~p/ a; m+n is even 

□ 

Proof (of Claim l6l8l t: For m + n > 4 and mi > m and ni > n, we define „j _m.„ as follows : 

r^,,nu^.n = Mm:,n, f J j f M ' /^O (1) ' h, (1) • h„ (0) • /ii (0) • (—^ + (-1)'" • 

Next, define the set of constraints Am as 

= {0 < ho{0) < 1 , < Ti{0) < 1 , -1 < hn{l) < 1 , -1 < < 1} 

Now, it is easy to see that A hi Am- Hence, by using the third bullet of Fact l2.3l if for any p and d G N, Am h^ p > 0, 
then Ahd p > 0. We shall be using this fact consistently throughout this proof. Applying Fact lC.2l we get that 

- — -mi— m -^-ni— n 

A hm,-rn+l < Hq (0) < 1 Ah 

ni — 71+1 

0<hi (0) < 1 
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and hence we have that 

0<ho (0) • hi (0) < 1 (31) 
Now, we consider two possibilities : Either m, n > 2 or ma.x{m, 7i} > 3. In the first case, using Fact |C.6l we get that 

-^4 - — -4 - — -ri - — -4 -^4 

A hi+,„ax{2™,2„} -{ho (1) + hi (1)) < h„ (1) • hi (1) < ho (1) + hi (!) (32) 

Next, consider the other case i.e. when niax{m, n} > 3. Without loss of generahty, assume m > 3 and n — 1. Then, 
by Fact |C.7| we get that 

^-i ^-i ^-rn ^~4 -^4 

A Kn+2 -{ho (1) + hi (1)) < /.o (1) • hi (1) < /lo (1) + hi (1) 
Now, combining < [3T] t along with an appHcation of Fact lC.81 we get that 

+ni+3 ^e^O (1) + ^1 (1)) < /iQ (1) • '^1 (1) • (0) ■ /ll (0) < ^0 (1) + ^1 (1) 

Now, recalUng that Q < mi,ni,m,n < K, (™') , ("^') < 2^, |^„i,„| < c and \p'\ < 1, we get that 

J- 7^^l ,ni ,m,n <c22^(/io (l) + /ii (1)) 

As, 



mi ,ni ,m,n 

m,n:m+n is even ^ ^ m,n:m+n is even 

and m+n>4 m+n>4 

K>mi>m K>ni>n 

As a resuh, we can conclude that 

A h4K+3 -c22^i^^(C(l) + /^'(l)) < 51 < c22^i^*(C(l) + 

m,n:m+n is even 
and m+n>4 

□ 

Proof (of Claim l6l9l l: We begin by defining the matrix M as follows : 



M 



d^Jjx y) I d'^J{x,v) 
dx^ ^ dxdy 

I dxdy dy-^ 



Put /3 = 2e. Now, let us define Mi M + /?/. Using Claim[321and ClaimlO for p' e (-1,0), and {x,y) G 
[e, 1 — e]^, we can say that Mi > e ■ I. Hence, using Corollarv lC.llI 3d'^ such that we have the following 

= ^ hrf; /lo (1) • + 2p • ho{l) ■ hi{l) ■ g^g^ + hi (1) • ^ -/^(^o (1) + hi (1)) 

Dividing by 2 on both sides, finishes the proof. Note that the reason the degree d'^ depends only on e and p is 
because from Corollarv lC.l II the degree d'^ depends on e, p and the polynomial J which again in turn depends only 
on e and p. □ 

Proof (of Lemma [6.10) : The proof is by induction. We introduce the following notation : For any z G {—1, 1}\ 

we use 1 • z G { — 1, 1}'^^ to denote the string z with a 1 prefixed to it. Likewise, we define — 1 • z G {—1, 1}*^^ to 
denote the string z with a —1 prefixed to it. Next, for any pairs of strings z_i, zi G { — 1, 1}* and j, k G { — 1, 1}, we 
define the indeterminate, hi{j) = E^^^_i^i}n.-i-i[g{z ■ j ■ z.^)]. Define A^_^^^^ = Lij^ke{~iA}U < hi{j) < 1 - e}. 
It is trivial to see that A'phi Az_^.z^. 
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Now, using Lemma |6^ for any two strings zi, z_i G { — 1, 1}*, we get that 

xe{-i,i} ^ 



As a consequence of the third bullet of Fact 12.31 the left hand side of 'h' can be replaced by A'^. Now, for any 
given zi, z_i e { — 1, 1}\ let d{zi, be its Hamming distance. We multiply the inequality by (^khLy-dizi,z^i) . 
(i^^^)'^'^^^'^-'^). Note that we will consider the case when i = and hence zi and z_i is the empty string. In 
this scenario, d{zi, z_i) is defined to be zero. We now consider all the above inequalities generated by choosing 
{zi, Z-i) G {—1, 1}' X { — 1, 1}' for < I < n. Next, we add all these inequalities (Fact IC. il l but the degree of the 
resulting SoS proof remains d-y. Now, it is easy to see that all terms of the form : J(E[gz^], E[gz_ J) cancel out except 
when zi, z_i G { — 1, 1}" or zi = z_i = 0. 

E [J{gix),g{y))] > J(E[g(a:;)], E[g(y)]) + error terms 

xe{-i,iV 

We now compute the error terms. First, we sum up the error coming from the term e ^p^^ {n — i) + 'gl -^ {n — i) 
For any given zi G { — 1, 1}', consider the term pg1_^{n — i). For every z_i G { — 1, 1}*, it occurs with the factor 

— ^ • . Since there are exactly (^j strings z_i G { — 1, 1}* such that d{zi, ~ fc, hence 

we get that the total weight associated is 



fc=0 



Thus, we get that the first kind of error terms contribute e {^^^ {n — i) + 'gl_^ {n — The calculation of the ' 'fourth 
degree" error terms is exactly identical resulting in the final theorem. □ 

Proof (of Claim|02): We have 

he J2 E.e{-i,i}45^(^ " i)K{n " 0] < ^ E E.e{-i,i}'5^(^ - + ^ ^ 



i=0 



Next, recall that using Fact l6.1ll we have Ap I-3 X]"=o Ej;g{_i — i) < 77. Similarly, from ( fTSl l and Claim|i 

we have that Ap \- —1 < gx{n — «)<!. This in turn implies Ap hj g^{n — i) < 'gl{n ~ i) (combining Fact lC.3l and 
FactlOli 

Combining all the above, we get 

Ap^jY.^-<^{-^^y'(9l{n-i)K{n^i)] < ^ - ( ^ E,g{_i,i}.52(^ _ j ^ 
i=o " \i=o / 

However, again we have that I-3 Y^^=o ^x<^{-i,iY9x{'^ ^) — S|s|>o5^('^) — E[ff^(a;)] < 1. This gives us the 
claim. □ 

Proof (of Claim|03): We have 

h4 5^E.,{„i,i}.[g^(n-z)£^(n-z)]<^ K]E,,{.i,ip5^(n-"-^l ' ^ 



n—1 /n— 1 



h4 E.e{-i,i}.?^(ri - ^) < 9''" • (E,e{-i.i}.^(n - i)) (using Fact|C9) 



i=0 \i=Q 
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As in the proof of Claim |6T2l we can show Ap hs J27=o ^xe{-i.i}^dti''^ ^ < 1- Combining all the above, we get 



i=0 \i=0 J 

Again, observe that 

SC{n-!:,...,n}:n-4eS SC [ri] :ri-j:eS SC[ri]:n-iGS:|S|<d^ 

< E /'(5)<inff^n/) 

SC[n]:ri-ieS:|S|<d, 

By using the second bullet of Fact 12. 3 1 we can also get 

n— 1 2 n—1 2 

h4 5] (E.e{-i,i}^?.(^-o) < E (inf,f-n/)) 

1=0 1=0 

Combining these, we get the final result. □ 
Proof (of Claim|04): 

+ ^ 2 

From the proof of Claim l6?T2l we know that Ap [-3 X)"=o^ 'Eixe{-i,i}^ ^K*^ ^ *) ^ Thus, we get 

\?2, 



, ^ ^ xA/ Er=o Exei-Lii^.g^l^-^KxC'^- 

However, 

Er=o^ ^xe{~i,iY dlin E^e{-i,i}' ^li"^ " 



< 



e{-ia}' 2 
Following the same proof as in the proof of Claim l6?T2l we can show that 

n-l 

Ap^^Y. E ff^(7i-z)<l 
Similarly, from the argument in the proof of Claim [6^31 we can show that 

n—1 /n—1 2' 

1=0 \i=0 

Combining these, we have the proof. □ 
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